CAGNet: A Multi-Scale Convolutional Attention Method for Glass Detection Based on Transformer

نویسندگان

چکیده

Glass plays a vital role in several fields, making its accurate detection crucial. Proper prevents misjudgments, reduces noise from reflections, and ensures optimal performance other computer vision tasks. However, the prevalent usage of glass daily applications poses unique challenges for vision. This study introduces novel convolutional attention segmentation network (CAGNet) predicated on transformer architecture customized image detection. Based foundation our prior study, CAGNet minimizes number training cycles iterations, resulting enhanced efficiency. is built upon strategic design integration two types mechanisms coupled with head applied comprehensive feature analysis fusion. To further augment precision, incorporates custom edge-weighting scheme to optimize within images. Comparative studies rigorous testing demonstrate that outperforms leading methodologies detection, exhibiting robustness across diverse range conditions. Specifically, IOU metric improves by 0.26% compared previous presents 0.92% enhancement over those state-of-the-art methods.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A multi-scale convolutional neural network for automatic cloud and cloud shadow detection from Gaofen-1 images

The reconstruction of the information contaminated by cloud and cloud shadow is an important step in pre-processing of high-resolution satellite images. The cloud and cloud shadow automatic segmentation could be the first step in the process of reconstructing the information contaminated by cloud and cloud shadow. This stage is a remarkable challenge due to the relatively inefficient performanc...

متن کامل

Contextual Multi-Scale Region Convolutional 3D Network for Activity Detection

Activity detection is a fundamental problem in computer vision. Detecting activities of different temporal scales is particularly challenging. In this paper, we propose the contextual multi-scale region convolutional 3D network (CMSRC3D) for activity detection. To deal with the inherent temporal scale variability of activity instances, the temporal feature pyramid is used to represent activitie...

متن کامل

Multi-Scale Fully Convolutional Network for Fast Face Detection

Image pyramid is a common strategy in detecting objects with different scales in an image. The computation of features at every scale of a finely-sampled image pyramid is the computational bottleneck of many modern face detectors. To deal with this problem, we propose a multi-scale fully convolutional network framework for face detection. In our detector, face models at different scales are tra...

متن کامل

Image Forgery Localization Based on Multi-Scale Convolutional Neural Networks

In this paper, we propose to use Multi-Scale Convolutional Neural Networks (CNNs) to conduct forgery localization in digital image forensics. A unified CNN architecture for input sliding windows of different scales is designed. Then, we elaborately design the training procedures of CNNs on sampled training patches in the IEEE IFS-TC Image Forensics Challenge training images. With a set of caref...

متن کامل

Multi-scale Cortical Keypoint Representation for Attention and Object Detection

Keypoints (junctions) provide important information for focus-of-attention (FoA) and object categorization/recognition. In this paper we analyze the multi-scale keypoint representation, obtained by applying a linear and quasi-continuous scaling to an optimized model of cortical end-stopped cells, in order to study its importance and possibilities for developing a visual, cortical architecture. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Mathematics

سال: 2023

ISSN: ['2227-7390']

DOI: https://doi.org/10.3390/math11194084